Faster Llm Inference No Accuracy Loss